Search CORE

6 research outputs found

pocl: A Performance-Portable OpenCL Implementation

Author: Berg Heikki
de La Lama Carlos Sánchez
Jääskeläinen Pekka
Raiskila Kalle
Schnetter Erik
Takala Jarmo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

OpenCL is a standard for parallel programming of heterogeneous systems. The benefits of a common programming standard are clear; multiple vendors can provide support for application descriptions written according to the standard, thus reducing the program porting effort. While the standard brings the obvious benefits of platform portability, the performance portability aspects are largely left to the programmer. The situation is made worse due to multiple proprietary vendor implementations with different characteristics, and, thus, required optimization strategies. In this paper, we propose an OpenCL implementation that is both portable and performance portable. At its core is a kernel compiler that can be used to exploit the data parallelism of OpenCL programs on multiple platforms with different parallel hardware styles. The kernel compiler is modularized to perform target-independent parallel region formation separately from the target-specific parallel mapping of the regions to enable support for various styles of fine-grained parallel resources such as subword SIMD extensions, SIMD datapaths and static multi-issue. Unlike previous similar techniques that work on the source level, the parallel region formation retains the information of the data parallelism using the LLVM IR and its metadata infrastructure. This data can be exploited by the later generic compiler passes for efficient parallelization. The proposed open source implementation of OpenCL is also platform portable, enabling OpenCL on a wide range of architectures, both already commercialized and on those that are still under research. The paper describes how the portability of the implementation is achieved. Our results show that most of the benchmarked applications when compiled using pocl were faster or close to as fast as the best proprietary OpenCL implementation for the platform at hand.Comment: This article was published in 2015; it is now openly accessible via arxi

arXiv.org e-Print Archive

Trepo - Institutional Repository of Tampere University

Radiomodeemin ohjausohjelmiston suunnittelu ja toteutus

Author: Raiskila Kalle
Publication venue
Publication date: 01/01/2006
Field of study

Aaltodoc Publication Archive

LordCore: Energy-Efficient OpenCL-Programmable Software-Defined Radio Coprocessor

Author: Berg Heikki
Jääskeläinen Pekka
Kokkonen Mikko
Kultala Heikki
Multanen Joonas
Raiskila Kalle
Takala Jarmo
Viitanen Timo
Zetterman Tommi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/02/2019
Field of study

This paper proposes a single instruction multiple data (SIMD) processor, which is programmed with high-level OpenCL language. The low-power processor is customized for executing multiple-input-multiple-output (MIMO) detection algorithms at a high performance while consuming very little power making it suitable for software-defined radio (SDR) applications. The novel combination of SIMD operations on a transport programmed multicore datapath allows saving power on both the execution front end and the back end, leading to very good energy efficiency with a compiler programmable design. We demonstrate the feasibility of the architecture with the layered orthogonal lattice detector and minimum mean-square-error MIMO algorithms, which can be used as a software-defined radio implementation of the 3GPP local thermal equilibrium r11 standard. Compared to other state-of-the-art SDR architectures, the proposed design adds features that improve programmer productivity with an insignificant power and area impactacceptedVersionPeer reviewe

Trepo - Institutional Repository of Tampere University

LordCore: Energy-Efficient OpenCL-Programmable Software-Defined Radio Coprocessor

Author: Heikki Berg
Heikki Kultala
Jarmo Takala
Joonas Multanen
Kalle Raiskila
Mikko Kokkonen
Pekka Jaaskelainen
Timo Viitanen
Tommi Zetterman
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Power Optimizations for Transport Triggered SIMD Processors

Author: Berg Heikki
Jääskeläinen Pekka
Koskela Lauri
Kultala Heikki
Linjamäki Henry
Multanen Joonas
Raiskila Kalle
Simonsson Jesse
Takala Jarmo
Viitanen Timo
Zetterman Tommi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Power consumption in modern processor design is a key aspect. Optimizing the processor for power leads to direct savings in battery energy consumption in case of mobile devices. At the same time, many mobile applications demand high computational performance. In case of large scale computing, low power compute devices help in thermal design and in reducing the electricity bill. This paper presents a case study of a customized low power vector processor design that was synthesized on a 28 nm process technology. The processor has a programmer exposed datapath based on the transport triggered architecture programming model. The paper’s focus is on the RTL and microarchitecture level power optimizations applied to the design. Using register file datapath gating, register file banking and enabling clock gating of individual pipeline stages in pipelined function units, up to one fourth of power and energy savings could be achieved with only a small area overhead. On top of this, for the measured radio applications, the exposed datapath architecture helped to achieve major power improvements in comparison to the traditional VLIW programming model by utilizing optimizations unique to transport triggered architectures.acceptedVersionPeer reviewe

Crossref

Trepo - Institutional Repository of Tampere University

pocl: A Performance-Portable OpenCL Implementation

Author: AV Aho
Carlos Sánchez de La Lama
D Goldberg
Erik Schnetter
FE Allen
H Corporaal
Heikki Berg
J Fisher
J Janssen
Jarmo Takala
JM Muller
Kalle Raiskila
M Gschwind
Pekka Jääskeläinen
R Cytron
WH Press
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref